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r Abstract 

H 

C/^ ■ This paper is motivated by the following observation. Take a 3 x 3 random (Haar distributed) 

. orthogonal matrix T, and use it to "rotate" the north pole, xq say, on the unit sphere in i?^. 

I This then gives a point u = Txq that is uniformly distributed on the unit sphere. Now use the 

same orthogonal matrix to transform n, giving v = Tu = T^xq. Simulations reported in Marzetta 
et al (2002) suggest that v is more likely to be in the northern hemisphere than in the southern 
hemisphere, and, morever, that w = T^xq has higher probability of being closer to the poles itxo 
^ ■ than the uniformly distributed point u. In this paper we prove these results, in the general setting 

OO I of dimension p > 3, by deriving the exact distributions of the relevant components of u and v. The 

essential questions answered are the following. Let x be any fixed point on the unit sphere in R^, 

■ where p > 3. What are the distributions of U2 = x'T'^x and U3 = x'T^x? It is clear by orthogonal 
invariance that these distribution do not depend on x, so that we can, without loss of generality, 
take X to be xq = (1, 0, 0)' G R^. Call this the "north pole" . Then XqT'^xo is the first component 

op . of the vector T'^xq. We derive stochastic representations for the exact distributions of U2 and Us 

in terms of random variables with known distributions. 

^ , 1. Introduction 

c5 ' Random orthogonal matrices have long played an important role in multivariate statistical analysis 

(see, for example, James (1954), Muirhead (1982)); more recently such matrices have received 
attention in the encryption and signal processing literature (see Sloane (1983), Marzetta et al 
(2002)). Although the uniform (Haar) distribution on the group Op of p x p orthogonal matrices 
is fairly well understood, questions regarding powers of uniformly distributed random orthogonal 
matrices have arisen in the past few years. For Op, as well as for other classical compact Lie 
groups. Rains (1997) provides an intriguing description of the distribution of high powers of Haar 
distributed group elements. Some of the results in Rains (1997) are reproduced in Marzetta et 
al (2002) where it is demonstrated via numerically constructed graphic displays what happens on 
O3 for low powers of Haar distributed orthogonals. For example, suppose F is Haar distributed 
on O3 and xq = (1,0,0)' is a unit vector in R^. Figure 3 of Marzetta et al (2002, p. 947)) 
shows the curious result that simulated observations on T^xq bunch up around xq (compared to 
the uniformly distributed vector Txq), while simulated observations on T^xq bunch up around ±xq. 
This phenomenon is an example of what we call the "north pole problem" . 
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To justify our "north pole" nomenclature and describe the results below, suppose that T is 
uniformly distributed on Op, where p > S. Consider a column vector x G with x'x = ||aj|p = 1. 
For a positive integer k, let 

Uk = x'T'^x. (1.1) 

Obviously Uk has a distribution that depends on p, but we will suppress this dependence in our 
notation. In what follows, we will use the "law" notation, £(■), to denote the distribution or 

probability law of " • ", where " • "can be a random variable, a random vector, or a random matrix. 
In this notation, the uniform distribution on Op is characterized by its invariance, namely 



c{r) = c{gr) = c{rh), (1.2) 

for all for all g,h G Op. Equation (1.2) immediately implies 

£{x'r''x) = Ciy'V^y) (1.3) 



for all x,y G i?^ with ||a;|| = \\y\\ = 1. Therefore the distribution of C/fc in (1.1) does not depend on 
x, so in what follows we simply take 



X = Xq 



1'^ 



(1.4) 



We will then arbitrarily call xq the "north pole" on the unit sphere Sp = {x £ R^\\\x\\ = 1}. Next, 
observe that a;oa;o(r*'a;o) is the orthogonal projection of T'^Xq onto the 1-dimensional subspace 
spanjxo}. Therefore the random variable \Uk\ is just the length of the orthogonal projection of 
r^xo onto the span of the north pole. For this reason, we refer to the problem of finding C{Uk) as 
the "north pole problem" . 

The results in this paper give the exact distribution of C/fc for fc = 2, 3 and for all p > S. Our 
results are expressed in terms of random variables with known distributions, and this facilitates 
the simulation of C{Uk) for p fixed. For example, when k = 1, Ui is the (1, 1) element of F and 
this is well known to have the probability density on given by 

/(^|p) = -(1 - x2)(P-3)/2, \x\ < 1. (1.5) 

'^^ r(i)r(i(p-i))^ ; - M 

Of course, C/f has the beta distribution Beta(|, ^{p — 1)) with density function 

9o{y) = wiJ/y u^ y'~'^^ - y)^^'"'^"'' < y < 1. (1.6) 

^(2)^(2^- 1)) 



Our approach to finding the distributions of U2 and U3 is based on some recent results in 
Eaton and Muirhead (2008). These results are expressed in terms of a decomposition of the Haar 
distribution corresponding to the partition 
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where Fn is 1 x 1, is 1 x (p- 1), is (p-l) x 1 and r22 is {p— 1) x (p—l). In particular, >C(rii) 
is specified by the density (1.5). Eaton and Muirhead (2008) give the conditional distribution of 
(r2i,ri2) given Fii and the conditional distribution of r22 given (Fn, r2i, ri2). These results are 
summarized in Section 2 of this paper. 

The results in Section 2 are then used in Section 3 to give random variable presentations of 
C{U2) and C{U^). For example, here is the result for U2- 

Theorem 1.1: Let and ,^2 be independent random variables, where ^1 has density f{-\p) given 
in (1.5) and ^2 has density f{-\p— 1). Then 

£(c/2)=/:(e? + (i-e?)6). (1.8) 

The representation (1.8), which is derived in Section 3, shows that U2 can be thought of as 
a random convex combination (with weights ^\ and (1 — ^1)) of a random variable that is 1 and 
the random variable ^2. This helps explain the Marzetta et al (2002) simulation result that, when 
p = ?>,U2 exhibits some bunching near 1. Here are values of P{U2 > 0) for various dimensions p: 

Dimension p 3 4 5 10 20 50 100 500 

P{U2 > 0) 0.71 0.68 0.66 0.62 0.59 0.56 0.54 0.52 

These probabilities suggest convergence to 1/2 as p —> 00. In fact, it is straightforward to show 
that Ciy^U2) N{0, 1) as p ^ 00. 

The distribution U3 is somewhat more complicated than that of U2, but is still expressible in 
terms of independent random variables. It is given in Section 3. 

Powers of F have received attention in the literature, especially concerning the distribution 
of the eigenvalues of F*^, k = 1,2,.... For example, see Diaconis and Shahshahani (1994), Rains 
(1997), Marzetta et al (2002), and Diaconis (2003). 

2. A Decomposition Theorem 

Let F have the Haar distribution on Op and partition F as in (1.7). In this section, we describe a 
recent result of Eaton and Muirhead (2008) that characterizes C{T). The basic idea is to specify 

(i) ^(rii) 

(ii) £(F2i,Fi2|Fn) (2.1) 
{iii) >C(F22|Fii,F2i,Fi2) 

where (ii) and {Hi) in (2.1) refer to obvious conditional distributions. It is clear that >C(F) is 
completely determined by (i), {ii) and {Hi) in (2.1). 

In order to present a rigorous statement concerning the above conditional distributions, it is 
useful to modify the sample space for F slightly. Let 

= {g\ge Op, gne {-1,1)}, (2.2) 

where gn is the (1,1) element of the matrix g. Note that has probability one under the Haar 

distribution on Op. 

Recall that Sp is the unit sphere in R^. By the uniform distribution on Sp we mean the unique 
probability distribution on Sp that is invariant under the action of Op on Sp (see Eaton (1989), 
Chapter 2). 
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Theorem 2.1: Suppose that F has the Haar distribution on 0+ so Fn G (—1, 1). Let Vi and V2 
be independent uniformly distributed vectors in Sp-i. A version of the conditional distribution of 
(F2i,Fi2) given Fu is 



/:(F2i, FislFn) = jC-i^l - rl,V,, ^1 - rl,V2) (2.3) 

where Fu is fixed. 

In words, Theorem 2.1 means that given Fu, the two functions (1 — Ff^)~"^/^F2i and (1 — 
F^]^)~^/^F']^2 of r are independent and uniformly distributed on Sp-i. A proof of this is given in 
Eaton and Muirhead (2008). 

To specify the conditional distribution {in) in (2.1), let 

As stated above, (Wi,VF2) is independent of Fu. Further, Wi,W2 are iid uniform on 5p_i. 
Since (Fii,F2i,Fi2) and (Fu, PFi, VF2) are bimeasurable functions of each other, conditioning on 
(Fii, F21, F12) or (Fii,VFi,W^2) are equivalent. Next, let hi G Op-i have first column Wi, i = 1,2, 
so 

hiei = Wi, 1 = 1,2 

where ei is the first standard unit vector in RP~^. Then fill out the remaining columns of hi, 
i = 1,2 according to Proposition A. 2 in Eaton and Muirhead (2008) (with u = ei and v = Wi). 

Theorem 2.2: Suppose F is Haar distributed on Op. Let A be Haar distributed on Op-2. A 
version of the conditional distribution of F22 given (Fu, F2i,Fi2) is provided by the distribution 

with hi and /i2 fixed and specified above. 

The proof of this result can be found in Eaton and Muirhead (2008). One consequence of 
Theorems 2.1 and 2.2 is that the Haar distribution on Op can be generated by the Haar distribution 
on Op-2, the uniform distribution on Sp-\ and the density f{-\p). ■ Indeed, the following algorithm 

for generating F establishes this claim: 

1. First draw Fu from f{-\p) given in (1.5). 

2. Next draw iid Ui and U2 which are uniform on Sp-i and set 



2i = ^l-V\^Ui, V'i2 = sJl-V\^U2. 

3. Then construct the (p — 1) x (p — 1) orthogonal matrices hi and h2 (with Wi = Ui, i = 1,2) 
as above. 

4. Draw A uniformly from Op-2. 
With 



A 



22 



-Fn 
A 



the random matrix 



Fii F12 

F21 hiA22h'2 

is uniform on Op. This assertion is a direct consequence of Theorems 2.1 and 2.2. 



4 



3. The North Pole Distribution 

Here we use the results described in Section 2 to give the distribution of Uk for A; = 2,3. The 
notation estabhshed in (1.7) and (2.5) is used below. In this notation, some algebra shows that 

U2 = rj, + (1 - rj,)w^Wi (3.1) 

and 

U3 = r?i + 2ru(i - Tl^)w^Wi + (1 - Tl^)wir22Wi. (3.2) 

We begin with a lemma concerning uniform random vectors on Sp-i. 
Lemma 3.1: Suppose Vi and V2 are independent and uniform on iSp_i. Then the random variable 

6 = ^i'^2 (3.3) 

has the density f{-\p — 1) defined in (1.5). 
Proof: Because jO.{gVi) = £.{Vi) for all g G Cp-i, 

jr{^2)=C{igViyV2), gGOp.,. (3.4) 

Now condition on Vi and pick g so gVi = € 5p-i, where z'^ = (1,0, ...,0). Then the conditional 
distribution of ^2 given Vi is the same as V2 which is just the first element of the uniform random 
vector V2. That this element has density f{-\p— 1) is easily established. Since this distribution 
does not depend on Vi, the conclusion of the lemma follows. 

The proof of Theorem 1.1 is now immediate. From Theorem 2.1, the random variables Wi 
and W2 are independent of Fn and are iid uniform on Sp-i. Since ^1 = Fn has density f{-\p) the 
representation (1.8) now follows from (3.1) and Lemma 3.1. 

To begin the discussion of C{U^)^ we first establish the following lemma. 

Lemma 3.2: Fix Fn G (—1, 1) and let A be uniform on Op-2- Also, let u and v be fixed vectors 
in Sp-i with ui and vi denoting the first element of u and v respectively. Then 

-^(^'("o" a) ^) = -^(-^i^i^n + V'(l - u?)(l - vl)i^). (3.5) 

where .^3 has density f{-\p — 2). 

Proof: Since A is uniform on Op-2, jO{gAh) = JC.{A) for all g,h & Op-2- This implies that 
where 

"•"(J D' "'={1 1)- f-''^^-- 

Therefore, for g,h e Op-2, 

5 



It is clear that we can now pick g* and h* so that 



9 u 



( 





V 



= u, h*u = 



( 





V 



= V. 



J 



With these choices for g* and /i*, 



:(u' 



-rn 

A 



=c{u' 



-Til 
A 



).) 



= C{-uiViTii + J {I - ul){l - vD^ii) 



where An is the (1,1) element of A. Since A is uniform on Op-2-, it follows immediately that 
^3 = All has density f{-\p — 2). This completes the proof of Lemma 3.2. 

Next, the conditional distribution of C/3 in (3.2), given (rii,r2i,ri2), is described in the 
following lemma. 

Lemma 3.3: A version of the conditional distribution of given (Ln, r2i, ri2) is 

c{t\^ + 2rn(l - Tl^)W!,Wi + (1 - Vl^)[-{W!,WifVii + (1 - {W!,Wif)^) (3.8) 

where ^3 has density f{-\p — 2) and all the remaining quantities in (3.8) are fixed. 

Proof: In the notation of Theorem 2.2, recall that the matrix hi has first column Wi, i = 1,2 and 
a version of the conditional distribution of r22 given (Tn, r2i, ri2) is 



(3.9) 



with A uniform on Op-2- Therefore a version of the conditional distribution of W2T22W1, given 

(rii,r2i,ri2) , is 







A 



KWi 



(3.10) 



with everything fixed except A. Now, we apply Lemma 3.2 with u = h'-^W2 and v = h'2Wi. From 
the definition of hi and /12, it is clear that the first element of both u and v is ui = vi = W2W1. 
Prom this and (3.5), we conclude that (3.10) is just 



jc[-{w^Wifrii + (1 - {w^Wif)Cs). 



This completes the proof of Lemma 3.3. 

Theorem 3.4: Let have density f{-\p + 1 — i) for i = 1, 2, 3 where ^1, ^2, and ^3 are mutually 
independent. Then 



c{Us) =c[ei + 26(1 - e?)6 + (1 - ei) Hie2 + (i - 



(3.11) 



Proof: From Lemma 3.3, a version of the conditional distribution of C/3 given (rii,r2i,ri2) is 
provided by (3.8). In (3.8), ^3 is independent of (Fn, r2i, ri2). Setting ^1 = Fn and ^2 = W2W1, it 
follows that has density f{-\p+l—i), for i = 1,2, 3. From Theorem 2.1, ^1 and ^2 are independent. 
By construction, ^3 is independent of (^1,^2) so joint independence follows. Unconditioning now 
yields (3.11) as €{1/3). This completes the proof. 

The results of Theorem 3.4 provide an easy method to simulate C{Uj,). One simply draws 
^1, ^2, and ^3 independently as indicated and calculates the algebraic expression in these variables 
given in (3.11). Note that if Zi, . . . , Zp are iid A^(0, 1) variables, then the random variable 

has the density f{-\p). 

Finally, we note that J~-{^U3) ^ A^(0, 1) as p ^ 00. We conjecture that a similar result holds 
for Uk, for any fixed k. The method of moments technique used by Krishnapur (2007) in his proof 
of Lemma 10 could most likely be used to establish this, but we have not carried out the details. 
Our proof for the cases of A; = 1, 2, 3 does not appear to generalize. 
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